SCAI TREC-8 Experiments
نویسندگان
چکیده
This working note reports our experiences with TREC-8 on four tracks: Ad Hoc, Filtering, Web, and QA. The Ad Hoc retrieval engine, SCAIR, has been used for the Web and QA experiments, and the ltering experiments were based on it's own engine. As a second entry to TREC, we focused this year on exploring possibilities of applying machine learning techniques to TREC tasks. The Ad Hoc track employed a cluster-based retrieval method where the scoring function used cluster information extracted from a collection of precompiled documents. Filtering was based on naive Bayes learning supported by an EM algorithm. In the Web track, we compared the performance of using link information to that of not using the information. In the QA track, some passage extraction techniques have been tested using the baseline SCAIR retrieval engine.
منابع مشابه
Patent Retrieval in Chemistry Based on Semantically Tagged Named Entities
This paper reports on the work that has been conducted by Fraunhofer SCAI for Trec Chemistry (Trec-Chem) track 2009. The team of Fraunhofer SCAI participated in two tasks, namely Technology Survey and Prior Art Search. The core of the framework is an index of 1.2 million chemical patents provided as a data set by Trec. For the technology survey, three runs were submitted based on semantic dicti...
متن کاملCLARIT TREC-8 Manual Ad-Hoc Experiments
CLARITECH’s submission in TREC-7 demonstrated the utility of document clustering in retrieval. We continued this work in TREC-8, using a clustered document presentation exclusively. We also added significant new functionality to the manual ad hoc user interface, integrating it with an entity extraction subsystem (upgraded and customized for TREC). Extracted entities represent an alternate set o...
متن کاملExperiments on the TREC-8 Filtering Track
For this year's TREC, KDD R&D Laboratories focused on the adaptive ltering experiments of the Filtering Track. The main focus of our research was the development and evaluation of the pro le updating algorithm. Our pro le updating algorithm is based on the query expansion method based on word contribution[1][2]. Given manual feedback, our QE method has achieved high performance in the ad hoc tr...
متن کامل